Relational Sequence Alignment

نویسندگان

  • Andreas Karwath
  • Kristian Kersting
چکیده

The need to measure sequence similarity arises in information extraction, music mining, biological sequence analysis, and other domains, and often coincides with sequence alignment: the more similar two sequences are, the better they can be aligned. Aligning sequences not only shows how similar sequences are, it also shows where there are differences and correspondences between the sequences. Traditionally, the alignment has been considered for sequences of flat symbols only. Many real world sequences such as protein secondary structures, however, exhibit a rich internal structures. This is akin to the problem of dealing with structured examples studied in the field of inductive logic programming (ILP). In this paper, we propose to use wellestablished ILP distance measures within alignment methods. Although straight-forward, our initial experimental results show that this approach performs well in practice and is worth to be explored.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sequence Alignment as a Database Technology Challenge

Sequence alignment is an important task for molecular biologists. Because alignment basically deals with approximate string matching on large biological sequence collections, it is both data intensive and computationally complex. There exist several tools for the variety of problems related to sequence alignment. Our first observation is that the term ’sequence database’ is used in general for ...

متن کامل

An Application of the ABS LX Algorithm to Multiple Sequence Alignment

We present an application of ABS algorithms for multiple sequence alignment (MSA). The Markov decision process (MDP) based model leads to a linear programming problem (LPP), whose solution is linked to a suggested alignment. The important features of our work include the facility of alignment of multiple sequences simultaneously and no limit for the length of the sequences. Our goal here is to ...

متن کامل

gpALIGNER: A Fast Algorithm for Global Pairwise Alignment of DNA Sequences

Bioinformatics, through the sequencing of the full genomes for many species, is increasingly relying on efficient global alignment tools exhibiting both high sensitivity and specificity. Many computational algorithms have been applied for solving the sequence alignment problem. Dynamic programming, statistical methods, approximation and heuristic algorithms are the most common methods appli...

متن کامل

Progressive Alignment Facilitates Learning of Deterministic But Not Probabilistic Relational Categories

Kotovsky and Gentner (1996) showed that presenting progressively aligned examples helped children discover relational similarities: Comparisons based on initially concrete and highly similar, but progressively more abstract exemplars helped the discovery of higher-order relational similarities. We investigated whether progressive alignment can aid learning of relational categories with either a...

متن کامل

Using relational databases to analyze Microarray probes and single nucleotide Polymorphisms

Microarrays such as those from the Affymetrix Incprovide a very useful means of studying thousands of genes for DNA analysis and expression levels and are also valuable in the study of single nucleotide polymorphisms (SNPs). While the physical use of gene expression microarrays involving the assessment of expression levels by 'washing' the arrays with extracted mRNA is their primary purpose, th...

متن کامل

Influenza sequence and epitope database

Influenza epidemics arise through the acquisition of viral genetic changes to overcome immunity from previous infections. An increasing number of complete genomes of influenza viruses have been sequenced in Asia in recent years. Knowledge about the genomes of the seasonal influenza viruses from different countries in Asia is valuable for monitoring and understanding of the emergence, migration ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006